智能论文笔记

Neuromuscular Control of the Face-Head-Neck Biomechanical Complex With Learning-Based Expression Transfer From Images and Videos

Xiao S. Zeng , Surya Dwarakanath , Wuyue Lu , Masaki Nakada , Demetri Terzopoulos

分类：计算机视觉

2021-11-12

从人们到3D面部模型的面部表情转移是一种经典的计算机图形问题。在本文中，我们提出了一种基于学习的基于学习的方法，将来自图像和视频从图像和视频转移到面部头颈络合物的生物力学模型。利用面部动作编码系统（FACS）作为表达空间的中间表示，我们训练深度神经网络，采用FACS动作单元（AUS），并为肌肉骨骼模型输出合适的面部肌肉和钳口激活信号。通过生物力学模拟，激活变形了面部软组织，从而将表达转移到模型。我们的方法具有比以前的方法相比。首先，面部表情是剖贯的一致，因为我们的生物力学模型模拟了面部，头部和颈部的相关解剖结构。其次，通过使用从生物力学模型本身产生的数据训练神经网络，我们消除了数据收集的表达式转移的手动努力。通过涉及转移到面部表情和头部姿势的实验，通过实验证明了我们的方法的成功。

translated by 谷歌翻译

Integrating Heterogeneous Domain Information into Relation Extraction: A Case Study on Drug-Drug Interaction Extraction

Masaki Asada

分类：自然语言处理

2022-12-21

The development of deep neural networks has improved representation learning in various domains, including textual, graph structural, and relational triple representations. This development opened the door to new relation extraction beyond the traditional text-oriented relation extraction. However, research on the effectiveness of considering multiple heterogeneous domain information simultaneously is still under exploration, and if a model can take an advantage of integrating heterogeneous information, it is expected to exhibit a significant contribution to many problems in the world. This thesis works on Drug-Drug Interactions (DDIs) from the literature as a case study and realizes relation extraction utilizing heterogeneous domain information. First, a deep neural relation extraction model is prepared and its attention mechanism is analyzed. Next, a method to combine the drug molecular structure information and drug description information to the input sentence information is proposed, and the effectiveness of utilizing drug molecular structures and drug descriptions for the relation extraction task is shown. Then, in order to further exploit the heterogeneous information, drug-related items, such as protein entries, medical terms and pathways are collected from multiple existing databases and a new data set in the form of a knowledge graph (KG) is constructed. A link prediction task on the constructed data set is conducted to obtain embedding representations of drugs that contain the heterogeneous domain information. Finally, a method that integrates the input sentence information and the heterogeneous KG information is proposed. The proposed model is trained and evaluated on a widely used data set, and as a result, it is shown that utilizing heterogeneous domain information significantly improves the performance of relation extraction from the literature.

translated by 谷歌翻译

Estimating truncation effects of quantum bosonic systems using sampling algorithms

Masanori Hanada , Junyu Liu , Enrico Rinaldi , Masaki Tezuka

分类：人工智能 | 机器学习

2022-12-16

To simulate bosons on a qubit- or qudit-based quantum computer, one has to regularize the theory by truncating infinite-dimensional local Hilbert spaces to finite dimensions. In the search for practical quantum applications, it is important to know how big the truncation errors can be. In general, it is not easy to estimate errors unless we have a good quantum computer. In this paper we show that traditional sampling methods on classical devices, specifically Markov Chain Monte Carlo, can address this issue with a reasonable amount of computational resources available today. As a demonstration, we apply this idea to the scalar field theory on a two-dimensional lattice, with a size that goes beyond what is achievable using exact diagonalization methods. This method can be used to estimate the resources needed for realistic quantum simulations of bosonic theories, and also, to check the validity of the results of the corresponding quantum simulations.

translated by 谷歌翻译

Multi-objective Tree-structured Parzen Estimator Meets Meta-learning

Shuhei Watanabe , Noow Awad , Masaki Onishi , Frank Hutter

分类：机器学习 | 人工智能

2022-12-13

Hyperparameter optimization (HPO) is essential for the better performance of deep learning, and practitioners often need to consider the trade-off between multiple metrics, such as error rate, latency, memory requirements, robustness, and algorithmic fairness. Due to this demand and the heavy computation of deep learning, the acceleration of multi-objective (MO) optimization becomes ever more important. Although meta-learning has been extensively studied to speedup HPO, existing methods are not applicable to the MO tree-structured parzen estimator (MO-TPE), a simple yet powerful MO-HPO algorithm. In this paper, we extend TPE's acquisition function to the meta-learning setting, using a task similarity defined by the overlap in promising domains of each task. In a comprehensive set of experiments, we demonstrate that our method accelerates MO-TPE on tabular HPO benchmarks and yields state-of-the-art performance. Our method was also validated externally by winning the AutoML 2022 competition on "Multiobjective Hyperparameter Optimization for Transformers".

translated by 谷歌翻译

Efficient stereo matching on embedded GPUs with zero-means cross correlation

Qiong Chang , Aolong Zha , Weimin Wang , Xin Liu , Masaki Onishi , Lei Lei , Meng Joo Er , Tsutomu Maruyama

分类：计算机视觉

2022-12-01

Mobile stereo-matching systems have become an important part of many applications, such as automated-driving vehicles and autonomous robots. Accurate stereo-matching methods usually lead to high computational complexity; however, mobile platforms have only limited hardware resources to keep their power consumption low; this makes it difficult to maintain both an acceptable processing speed and accuracy on mobile platforms. To resolve this trade-off, we herein propose a novel acceleration approach for the well-known zero-means normalized cross correlation (ZNCC) matching cost calculation algorithm on a Jetson Tx2 embedded GPU. In our method for accelerating ZNCC, target images are scanned in a zigzag fashion to efficiently reuse one pixel's computation for its neighboring pixels; this reduces the amount of data transmission and increases the utilization of on-chip registers, thus increasing the processing speed. As a result, our method is 2X faster than the traditional image scanning method, and 26% faster than the latest NCC method. By combining this technique with the domain transformation (DT) algorithm, our system show real-time processing speed of 32 fps, on a Jetson Tx2 GPU for 1,280x384 pixel images with a maximum disparity of 128. Additionally, the evaluation results on the KITTI 2015 benchmark show that our combined system is more accurate than the same algorithm combined with census by 7.26%, while maintaining almost the same processing speed.

translated by 谷歌翻译

Consistency between ordering and clustering methods for graphs

Tatsuro Kawamoto , Masaki Ochi , Teruyoshi Kobayashi

分类：机器学习

2022-08-27

通常，通过聚类或订购将标签分配给每个元素，通常可以分析关系数据集。尽管通过聚类和排序方法可以实现数据集的类似表征，但前者比后者更积极地研究了数据集，尤其是对于表示为图的数据。这项研究通过研究几种聚类和订购方法之间的方法学关系来填补这一空白，重点是光谱技术。此外，我们评估了聚类和订购方法的结果性能。为此，我们提出了一种称为标签连续性误差的度量，该度量通常量化了一组元素的序列和分区之间的一致性程度。基于合成和现实世界数据集，我们评估了订购方法标识模块结构和聚类方法标识带状结构的范围。

translated by 谷歌翻译

HTML版本

UnrealEgo: A New Dataset for Robust Egocentric 3D Human Motion Capture

Hiroyasu Akada , Jian Wang , Soshi Shimada , Masaki Takahashi , Christian Theobalt , Vladislav Golyanik

分类：计算机视觉

2022-08-02

我们提出Unrealego，即，一种用于以Egentric 3D人类姿势估计的新的大规模自然主义数据集。Unrealego是基于配备两个鱼眼摄像机的眼镜的高级概念，可用于无约束的环境。我们设计了它们的虚拟原型，并将其附加到3D人体模型中以进行立体视图捕获。接下来，我们会产生大量的人类动作。结果，Unrealego是第一个在现有的EgeCentric数据集中提供最大动作的野外立体声图像的数据集。此外，我们提出了一种新的基准方法，其简单但有效的想法是为立体声输入设计2D关键点估计模块，以改善3D人体姿势估计。广泛的实验表明，我们的方法在定性和定量上优于先前的最新方法。Unrealego和我们的源代码可在我们的项目网页上找到。

translated by 谷歌翻译

Dynamic Shielding for Reinforcement Learning in Black-Box Environments

Masaki Waga , Ezequiel Castellano , Sasinee Pruekprasert , Stefan Klikovits , Toru Takisaka , Ichiro Hasuo

分类：机器学习

2022-07-27

由于学习过程中缺乏安全保证，在网络物理系统中使用加固学习（RL）是具有挑战性的。尽管有各种建议在学习过程中减少不希望的行为，但这些技术中的大多数都需要先前的系统知识，并且其适用性是有限的。本文旨在减少学习过程中不希望的行为，而无需任何先前的系统知识。我们提出动态屏蔽：基于自动机学习的基于模型的安全RL技术的扩展。动态屏蔽技术使用RPNI算法的变体和RL平行构建近似系统模型，并由于学习模型构建的屏蔽而抑制了不希望的探索。通过这种组合，在代理商体验他们之前，可以预见潜在的不安全行动。实验表明，我们的动态盾牌可显着减少训练过程中不希望的事件的数量。

translated by 谷歌翻译

BMD-GAN: Bone mineral density estimation using x-ray image decomposition into projections of bone-segmented quantitative computed tomography using hierarchical learning

Yi Gu , Yoshito Otake , Keisuke Uemura , Mazen Soufi , Masaki Takao , Nobuhiko Sugano , Yoshinobu Sato

分类：计算机视觉

2022-07-07

我们提出了一种从普通X射线图像中估算骨矿物质密度（BMD）的方法。双能X射线吸收法（DXA）和定量计算机断层扫描（QCT）在诊断骨质疏松症方面具有很高的精度；但是，这些方式需要特殊的设备和扫描协议。测量X射线图像的BMD提供了机会筛查，这对于早期诊断可能有用。先前直接了解X射线图像和BMD之间关系的方法需要大型训练数据集，以实现高精度，因为X射线图像中的强度很大。因此，我们提出了一种使用QCT训练生成对抗网络（GAN）的方法，并将X射线图像分解为骨分割QCT的投影。提出的分层学习提高了定量分解小区域目标的鲁棒性和准确性。使用拟议的方法对200例骨关节炎评估，我们将其命名为BMD-GAN，在预测和地面真实DXA测量的BMD之间显示出Pearson相关系数为0.888。除了不需要大规模训练数据库外，我们方法的另一个优点是它的扩展性对其他解剖区域，例如椎骨和肋骨。

translated by 谷歌翻译

Fast Bayesian Inference with Batch Bayesian Quadrature via Kernel Recombination

Masaki Adachi , Satoshi Hayakawa , Martin Jørgensen , Harald Oberhauser , Michael A. Osborne

分类：机器学习 | (统计)机器学习

2022-06-09

贝叶斯后期和模型证据的计算通常需要数值整合。贝叶斯正交（BQ）是一种基于替代模型的数值整合方法，能够具有出色的样品效率，但其缺乏并行化阻碍了其实际应用。在这项工作中，我们提出了一种并行的（批次）BQ方法，该方法采用了核正素的技术，该技术具有证明是指数的收敛速率。另外，与嵌套采样一样，我们的方法允许同时推断后期和模型证据。重新选择了来自BQ替代模型的样品，通过内核重组算法获得一组稀疏的样品，需要可忽略的额外时间来增加批处理大小。从经验上讲，我们发现我们的方法显着优于在包括锂离子电池分析在内的各种现实世界数据集中，最先进的BQ技术和嵌套采样的采样效率。

translated by 谷歌翻译